NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multiscale Differential Geometry Learning for Protein Flexibility Analysis

https://doi.org/10.1002/jcc.70073

Feng, Hongsong; Zhao, Jeffrey Y; Wei, Guo‐Wei (March 2025, Journal of Computational Chemistry)

ABSTRACT Protein structural fluctuations, measured by Debye‐Waller factors or B‐factors, are known to be closely associated with protein flexibility and function. Theoretical approaches have also been developed to predict B‐factor values, which reflect protein flexibility. Previous models have made significant strides in analyzing B‐factors by fitting experimental data. In this study, we propose a novel approach for B‐factor prediction using differential geometry theory, based on the assumption that the intrinsic properties of proteins reside on a family of low‐dimensional manifolds embedded within the high‐dimensional space of protein structures. By analyzing the mean and Gaussian curvatures of a set of low‐dimensional manifolds defined by kernel functions, we develop effective and robust multiscale differential geometry (mDG) models. Our mDG model demonstrates a 27% increase in accuracy compared to the classical Gaussian network model (GNM) in predicting B‐factors for a dataset of 364 proteins. Additionally, by incorporating both global and local protein features, we construct a highly effective machine‐learning model for the blind prediction of B‐factors. Extensive least‐squares approximations and machine learning‐based blind predictions validate the effectiveness of the mDG modeling approach for B‐factor predictions.
more » « less
Free, publicly-accessible full text available March 15, 2026
Mayer-Homology Learning Prediction of Protein-Ligand Binding Affinities

https://doi.org/10.1142/S2737416524500613

Feng, Hongsong; Shen, Li; Liu, Jian; Wei, Guo-Wei (March 2025, Journal of Computational Biophysics and Chemistry)

Artificial intelligence-assisted drug design is revolutionizing the pharmaceutical industry. Effective molecular features are crucial for accurate machine learning predictions, and advanced mathematics plays a key role in designing these features. Persistent homology theory, which equips topological invariants with persistence, provides valuable insights into molecular structures. The standard homology theory is based on a differential rule for the boundary operator that satisfies [Formula: see text] = 0. Our recent work has extended this rule by employing Mayer homology with generalized differentials that satisfy [Formula: see text] = 0 for [Formula: see text] 2, leading to the development of persistent Mayer homology (PMH) theory and richer topological information across various scales. In this study, we utilize PMH to create a novel multiscale topological vectorization for molecular representation, offering valuable tools for descriptive and predictive analyses in molecular data and machine learning prediction. Specifically, benchmark tests on established protein-ligand datasets, including PDBbind-v2007, PDBbind-v2013, and PDBbind-v2016, demonstrate the superior performance of our Mayer homology models in predicting protein-ligand binding affinities.
more » « less
Free, publicly-accessible full text available March 1, 2026
Persistent Sheaf Laplacian Analysis of Protein Flexibility

https://doi.org/10.1021/acs.jpcb.5c01287

Hayes, Nicole; Wei, Xiaoqi; Feng, Hongsong; Merkurjev, Ekaterina; Wei, Guo-Wei (April 2025, The Journal of Physical Chemistry B)
Persistent Directed Flag Laplacian (PDFL)-Based Machine Learning for Protein–Ligand Binding Affinity Prediction

https://doi.org/10.1021/acs.jctc.5c00074

Zia, Mushal; Jones, Benjamin; Feng, Hongsong; Wei, Guo-Wei (April 2025, Journal of Chemical Theory and Computation)
CAML: Commutative Algebra Machine Learning─A Case Study on Protein–Ligand Binding Affinity Prediction

https://doi.org/10.1021/acs.jcim.5c00940

Feng, Hongsong; Suwayyid, Faisal; Zia, Mushal; Wee, JunJie; Hozumi, Yuta; Chen, Chun-Long; Wei, Guo-Wei (June 2025, Journal of Chemical Information and Modeling)
Knot data analysis using multiscale Gauss link integral

https://doi.org/10.1073/pnas.2408431121

Shen, Li; Feng, Hongsong; Li, Fengling; Lei, Fengchun; Wu, Jie; Wei, Guo-Wei (October 2024, Proceedings of the National Academy of Sciences)

In the past decade, topological data analysis has emerged as a powerful algebraic topology approach in data science. Although knot theory and related subjects are a focus of study in mathematics, their success in practical applications is quite limited due to the lack of localization and quantization. We address these challenges by introducing knot data analysis (KDA), a paradigm that incorporates curve segmentation and multiscale analysis into the Gauss link integral. The resulting multiscale Gauss link integral (mGLI) recovers the global topological properties of knots and links at an appropriate scale and offers a multiscale geometric topology approach to capture the local structures and connectivities in data. By integration with machine learning or deep learning, the proposed mGLI significantly outperforms other state-of-the-art methods across various benchmark problems in 13 intricately complex biological datasets, including protein flexibility analysis, protein–ligand interactions, human Ether-à-go-go-Related Gene potassium channel blockade screening, and quantitative toxicity assessment. Our KDA opens a research area—knot deep learning—in data science.
more » « less
Full Text Available
Multiscale differential geometry learning of networks with applications to single-cell RNA sequencing data

https://doi.org/10.1016/j.compbiomed.2024.108211

Feng, Hongsong; Cottrell, Sean; Hozumi, Yuta; Wei, Guo-Wei (March 2024, Computers in Biology and Medicine)

Full Text Available
ChatGPT in Drug Discovery: A Case Study on Anticocaine Addiction Drug Development with Chatbots

https://doi.org/10.1021/acs.jcim.3c01429

Wang, Rui; Feng, Hongsong; Wei, Guo-Wei (November 2023, Journal of Chemical Information and Modeling)

Full Text Available
Multiobjective Molecular Optimization for Opioid Use Disorder Treatment Using Generative Network Complex

https://doi.org/10.1021/acs.jmedchem.3c01053

Feng, Hongsong; Wang, Rui; Zhan, Chang-Guo; Wei, Guo-Wei (September 2023, Journal of Medicinal Chemistry)

Full Text Available
Machine learning study of the extended drug–target interaction network informed by pain related voltage-gated sodium channels

https://doi.org/10.1097/j.pain.0000000000003089

Chen, Long; Jiang, Jian; Dou, Bozheng; Feng, Hongsong; Liu, Jie; Zhu, Yueying; Zhang, Bengong; Zhou, Tianshou; Wei, Guo-Wei (January 2024, Pain)

Abstract Pain is a significant global health issue, and the current treatment options for pain management have limitations in terms of effectiveness, side effects, and potential for addiction. There is a pressing need for improved pain treatments and the development of new drugs. Voltage-gated sodium channels, particularly Nav1.3, Nav1.7, Nav1.8, and Nav1.9, play a crucial role in neuronal excitability and are predominantly expressed in the peripheral nervous system. Targeting these channels may provide a means to treat pain while minimizing central and cardiac adverse effects. In this study, we construct protein–protein interaction (PPI) networks based on pain-related sodium channels and develop a corresponding drug–target interaction network to identify potential lead compounds for pain management. To ensure reliable machine learning predictions, we carefully select 111 inhibitor data sets from a pool of more than 1000 targets in the PPI network. We employ 3 distinct machine learning algorithms combined with advanced natural language processing (NLP)–based embeddings, specifically pretrained transformer and autoencoder representations. Through a systematic screening process, we evaluate the side effects and repurposing potential of more than 150,000 drug candidates targeting Nav1.7 and Nav1.8 sodium channels. In addition, we assess the ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties of these candidates to identify leads with near-optimal characteristics. Our strategy provides an innovative platform for the pharmacological development of pain treatments, offering the potential for improved efficacy and reduced side effects.
more » « less
Full Text Available

« Prev Next »

Search for: All records